Universally Measurable Strategies in Zero-Sum Stochastic Games
نویسندگان
چکیده
منابع مشابه
Definable Zero-Sum Stochastic Games
Definable zero-sum stochastic games involve a finite number of states and action sets, reward and transition functions that are definable in an o-minimal structure. Prominent examples of such games are finite, semi-algebraic or globally subanalytic stochastic games. We prove that the Shapley operator of any definable stochastic game with separable transition and reward functions is definable in...
متن کاملReversibility and Oscillations in Zero-sum Discounted Stochastic Games
We show that by coupling two well-behaved exit-time problems one can construct two-person zero-sum stochastic games with finite state space having oscillating discounted values. This unifies and generalizes recent examples due to Vigeral (2013) and Ziliotto (2013).
متن کاملAlmost Stationary (-Equilibria in Zero-Sum Stochastic Games
We show the existence of almost stationary (-equilibria, for all (H0, in zero-sum stochastic games with finite state and action spaces. These are (-equilibria with the property that, if neither player deviates, then stationary strategies are played forever with probability almost 1. The proof is based on the construction of specific stationary strategy pairs, with corresponding rewards equal to...
متن کاملConvergence of “Best-response Dynamics” in Zero-sum Stochastic Games
Given a two-player zero-sum discounted-payoff stochastic game, we introduce three classes of continuous-time best-response dynamics, stopping-time best-response dynamics, closed-loop best-response dynamics, and open-loop best-response dynamics. We show the global convergence of the first two classes to the set of minimax strategy profiles, and the convergence of the last class when the players ...
متن کاملSecurity Strategies of Both Players in Asymmetric Information Zero-Sum Stochastic Games with an Informed Controller
This paper considers a zero-sum two-player asymmetric information stochastic game where only one player knows the system state, and the transition law is controlled by the informed player only. For the informed player, it has been shown that the security strategy only depends on the belief and the current stage. We provide LP formulations whose size is only linear in the size of the uninformed ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: The Annals of Probability
سال: 1985
ISSN: 0091-1798
DOI: 10.1214/aop/1176993080